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Abstract. Recent measurements of inclusive and semi-inclusive deep inelastic scattering in electron-proton 
collisions at HERA are reviewed. These measurements are used to determine the parton distribution functions 
(PDFs) of the proton, a necessary input to theory predictions for hadron colliders. An introduction to the PDF 
determination with an emphasis on HERA PDFs is presented. Theory predictions based on HERAPDF are com- 
pared to a selection of recent LHC and Tevatron measurements. The impact of jet and charm production mea- 
surements on the PDFs is discussed. 



1 Introduction 



A deep understanding of the proton structure is one of the 
most important topics in modern particle physics. A pre- 
cise knowledge of the Parton Distribution Functions (PDFs) 
of the proton is essential in order to make predictions for 
the Standard Model and beyond the Standard Model pro- 
cesses at hadron colliders. 

The cross sections of processes in proton-(anti)proton 
collisions are factorized by a convolution of the matrix ele- 
ment of parton-parton interaction and the proton structure. 
The latter is described by parton density functions (PDFs). 
A PDF, fi(x, Q 2 ), represents the probability of finding in 
the proton a parton ; (quark or gluon) carrying a fraction 
x of the proton momentum with Q being the energy scale 
of the hard interaction. In case of proton-(anti)proton in- 
teractions PDFs of both protons enter multiplicatively into 
the calculation of the process cross section. Therefore the 
precision of the PDFs is of particular importance for accu- 
rate cross-section predictions. In the last decades, the mea- 
surements of lepton-nucleon and proton-antiproton scat- 
tering have been used to determine the proton PDFs. At 
low to medium x the PDFs are constrained by HERA data. 
The measurements at fixed target experiments and Teva- 
tron contribute mainly at high x. The recent precise data 
from Tevatron and the LHC experiments have the potential 
to improve the precision on the PDFs further. 



2 Proton Structure and DIS at HERA 



The knowledge of the proton PDFs is obtained to a large 
extent from the measurements of the structure functions 
in deep inelastic scattering (DIS) experiments. In Fig. [T] 
the diagram of DIS is represented. The lepton is scattered 
off the nucleon via the exchange of a y or Z° -boson (neu- 
tral current, NC, process) or via the exchange of a W*- 
boson(charged current, CC). Here the scattering of an elec- 
tron (or positron) off the proton is discussed. 



The NC (and similarly CC) cross section can be ex- 
pressed in terms of the generalized structure functions: 



j2 e P 
d(r NC 



2na 



dxdQ 2 xQ 4 



Ft + Y-xF* 



where Y ± = 1 ±(1 -y) 2 with y being the transferred fraction 
of the lepton energy. The (generalized) structure function 
Fi (F2) is the dominant contribution to the cross section, 
xFt, is important at high Q 2 and Fl is sizable only at high y. 
In the framework of perturbative QCD the structure func- 
tions are directly related to the parton distribution func- 
tions, i.e. in leading order (LO) F2 is the momentum sum 
of quark and anti-quark distributions, F2 ~ x 2 e 2 (q + q), 
and XF3 is related to their difference, xFi w x £ 2-e q a q {q - 
q). At higher orders, terms related to the gluon density dis- 
tribution (a s g) appear. In analogy to neutral currents, the 
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Fig. 1. Diagrams of neutral NC and charged CC current deep 
inelastic scattering processes. The symbols denote the particles, 
the label "X" denotes the hadronic final state. 



inclusive CC ep cross section can be expressed in terms of 
structure functions and in LO the e + p and e~p cross sec- 
tions are sensitive to different quark densities: 



^cc = x ^ u + + (1 ~~ y) 2 x[d + s] 

^cc ~ x [ u + c ] + (1 _ yf'AA + s]. 



At HERA at DESY in Hamburg, electrons (or positrons) 
were collided with protons at centre-of-mass energies V* = 
225 - 3 18 GeV. The measurements of the NC and CC cross 
sections from HERA extend the kinematic regime in Q 2 
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by more than two orders of magnitude with respect to the 
fixed target experiments and cover the wide x range from 
1(T 7 to 0.7. At the HERA collider experiments, HI and 
ZEUS, the cross sections of NC and CC DIS are measured 
with high precision. The measurements of the two experi- 
ments are combined and are further used to determine par- 
ton distribution functions HERAPDF J2J . 



3 HERAPDF 

The PDFs are determined from the structure function mea- 
surements using the corresponding coefficient functions cal- 
culated to a certain order in perturbative QCD (pQCD). 
The structure functions, and in turn the PDFs, depend on x 
and Q. The x-dependence of the parton distributions is not 
yet calculable in pQCD and has to be parametrized at a cer- 
tain starting scale go- The dependence on Q is described 
by the DGLAP evolution equations [TJ. Starting from a 
parameterisation of the PDFs at a starting scale, either by 
making ad-hoc assumptions on their analytical form or by 
using the neural-net technology, fits to various sets of ex- 
perimental data, with HERA DIS data being the backbone, 
are performed within the DGLAP evolution scheme. The 
resulting PDFs depend on the order in which the perturba- 
tive QCD calculation is performed, the assumptions about 
the PDF parametrization, the treatment of heavy quarks, 
the choice for the value of a s (M z ) and the treatment of the 
uncertainties. The data sets included in the PDF fit and the 
consistency of these data sets determines the experimental 
uncertainty of the PDFs. 



HI and ZEUS 
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Fig. 2. Inclusive DIS cross sections for NC in e ± collisions at 
HERA. The measurements of the HI and ZEUS experiments are 
combined. Open (closed) symbols represent e~ p (e + p) scattering. 
The shaded curves represent QCD prediction based on HERA- 
PDF1.5NLO. 

The parton distributions HERAPDF [2 1 are determined 
using only combined HERA DIS data, where the corre- 
lations of the systematic uncertainties are properly taken 
into account. This allows the usage of the conventional x 1 
tolerance of Ax 2 = 1. Since this QCD analysis is solely 
based on ep data, the PDFs do not depend on the approach 
for nuclear corrections needed for fixed target data. Several 
phenomenological schemes of heavy quark treatment can 



be used in the HERAPDF approach. Therefore direct tests 
of the models are possible. The full statistics of the HERA 
inclusive CC and NC data are used for NLO and NNLO 
QCD fits resulting in HERAPDF 1.5 0. As an example, 
the combined NC cross sections are shown in Fig. [2] to- 
gether with QCD prediction based on HERAPDF1.5NLO. 

The QCD analysis HERAPDF1.5 follows the formal- 
ism, model and paramatrisation assumptions as reported 
in |2). The QCD predictions for the structure functions 
are obtained by solving the DGLAP evolution equations 
at NLO (or NNLO) in the MS scheme with the renormal- 
isation and factorisation scales chosen to be Q 2 . The QCD 
predictions for the structure functions are obtained by the 
convolution of the PDFs with the NLO coefficient func- 
tions calculated using the general mass variable flavour 
number RT scheme [4 |. For the parametrisation of PDFs at 
the input scale the generic form xf(x) = Ax B (l - x) c (l + 
Ex 2 ) is used. The parametrised PDFs are the gluon distri- 
bution, the valence quark distributions and the M-type and 
(f-type anti-quark distributions. The normalisation parame- 
ters A are constrained by the quark number and momentum 
sum-rules. 
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Fig. 3. The parton distribution functions from HERAPDF1.5 
NNLO. The gluon and sea distributions are scaled down by a 
factor of 20. The experimental, model and parametrisation un- 
certainties are shown. 

In Fig. [3]the parton distributions HERAPDF1 .5NNLO 
at Q 2 - 10000 GeV 2 are shown. In addition to the ex- 
perimental uncertainties, the variation of model inputs and 
parametrisation in the determination of HERAPDF are per- 
formed and provided as additional eigenvectors. The model 
uncertainties are evaluated by varying the input assump- 
tions on minimum Q 2 of the data used in the fit, the stran- 
geness fraction and the masses of heavy quarks. The para- 
metrisation uncertainty is formed by an envelope of the 
maximal deviations from the central fit varying parametri- 
sation assumptions. HERAPDF1.5NLO and NNLO sets 
are the recommended HERA PDFs to be used for the pre- 
dictions of processes at the LHC. The corresponding eigen- 
vectors are available Q. 



4 Benchmarking HERAPDF 

The PDFs are intrinsic properties of the proton and are 
therefore process-independent. Cross section predictions 
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for processes in proton-(anti)proton collisions can be ob- 
tained using HERAPDF, evolved in Q 2 using DGLAP equa- 
tions. 

The measurements of jet production at hadron collid- 
ers is an important instrument to probe PDFs at high x and 
also provide additional constraints on the value as(Mz). 
In Fig. [4] the jet production cross sections as measured by 
DO experiment |6| is presented. The measurement is con- 
fronted with the QCD prediction at NLO 17151 based on 
HERAPDF1.5NLO. The data is very well described by this 
prediction. In Fig.[5]the jet measurement from ATLAS ex- 



Tevatron inclusive jet cross sections 




Fig. 4. Jet production cross section as a function of the jet trans- 
verse momentum for different ranges of pseudorapidity, as mea- 
sured by the DO collaboration. The data are represented by closed 
symbols. The measurement is compared to the QCD calcula- 
tion at NLO based on HERAPDF1.5NLO. The total PDF uncer- 
tainty and hadronisation corrections on the prediction is shown 
as shaded bands. 



periment [9] in a central rapidity bin is shown in compar- 
ison with NLO predictions using HERAPDF1.5NLO to- 
gether with several other PDFs. The QCD prediction using 
HERAPDF1.5NLO describes the data very weU. 



The W-boson muon asymmetry as measured by the CMS 
experiment IfTUl is shown in Fig. [6] The measurement is 
compared to NLO predictions 1 12] obtained using HERA- 
PDF1.5NLO, MSTW08 El and CTlOWd PDFs. The 
prediction based on HERAPDF 1.5NLO describes the data 
well. 
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Fig. 6. The W muon charge asymmetry as measured by the 
CMS experiment. The measurement ( closed symbols) is compared 
to the NLO prediction (J2jl using HERAPDF 1 . 5 NLO (shaded 
band), MSTW08NLO (dotted line) and CT10W (dashed line). 



Top quark pair production at the LHC probes the gluon 
density at high x. In Fig.[7]the cross-section measurement 
of top pair production is shown as a function of the top- 
quark pole mass in comparison to approximate NNLO cal- 
culations lfT6TT71 based on HERAPDF1 .5NNLO. The the- 
ory uncertainty accounts for the variation of the QCD scales, 
PDFs error and the variation of as (Mz) in the PDF. For the 
PDF uncertainty of HERAPDF 1 .5NNLO, only the eigen- 
vectors for experimental errors are used. The predictions 
describe the data very well. 
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Fig. 5. Inclusive jet production cross section as a function of the 
jet transverse momentum, as measured by the ATLAS collabora- 
tion in the rapidity range < y < 0.3 The jets are identified using 
the anti-k, algorithm with R = 0.4. The data are represented in a 
ratio (stars) to the QCD prediction, using CTEQ6.6 II IV as a ref- 
erence PDFs. The central value for the QCD calculation at NLO 
based on HERAPDF1 .5NLO is represented by closed circles sur- 
rounded by the error band shown as the hashed area. 



Production of electroweak bosons provides important 
constraints on the light quark distributions. For example, 



the W lepton charge asymmetry A/(W) = 



(<T w ++cr w -) 



is sensitive to the valence u and d quark ratio. 
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Fig. 7. The top-pair production cross section measured by the 
CMS experiment (closed square) shown at the assumption on 
the top mass, used in the analysis. The mass dependence of the 
tt cross section according to approximate NNLO QCD predic- 
tions M6V and M7V are represented by the shaded and hashed 
band, respectively. The dependence of the experimental measure- 
ment on the assumption on the m, in the simulation used for effi- 
ciency and detector corrections is shown in by light shaded band. 
The closed circle represents the cross section measurement, cor- 
rected for the top pole mass, extracted using the calculation M6V . 
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5 Global benchmarking excercise 



6.1 Including jet data in the PDF fit: HERAPDF1 .6 



Presently, the determination of PDFs is carried out by sev- 
eral groups, namely MSTW flj|, CTEQ HI), NNPDF ||20), 
HERAPDF @, AB(K)M ED and GJR E2|. The large 
number of PDF parameters and their treatment in the fit- 
ting procedure within the different groups results in differ- 
ences of the PDFs provided. In order to study these dif- 
ferences, a benchmarking exercise is being carried out by 
the PDF4LHC working group l23l formed by the mem- 
bers of the PDF fitting groups mentioned above. As an 
example, the NLO prediction for the Higgs cross section 
(M H = 120 GeV) at the LHC is shown in Fig.|8]for differ- 
ent PDF sets as a function of as(Mz). For different PDF 
groups not only the value of a s (M z ), but also the run- 
ning of the strong coupling is different, resulting in dif- 
ferent cross section predictions. The HERAPDF is an ac- 
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Fig. 8. NLO Higgs cross section predictions (M H - 
using different PDFs at the LHC with yfs = 7 TeV. 
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tive participant in the benchmarking exercise. In contrast to 
other PDF groups, HERAPDF is not restricted to one par- 
ticular heavy flavour treatment scheme, several schemes 
are implemented and can be tested. Also, by providing the 
PDF eigenvectors for model parameter and parametriza- 
tion variations, HERAPDF allows for tests of specific pa- 
rameterisation and model assumptions during the QCD ana- 
lysis of different data sets. In the following, the inclusion of 
semi-inclusive DIS data in the QCD analysis HERAPDF 
and the impact of these data on assumptions on as(Mz) 
and the charm quark mass value in the PDF fit is discussed. 



In addition to the combined HERA inclusive DIS data as 
used in the QCD analysis HERAPDF1.5, HI and ZEUS 
measurements of jet production cross sections ll24l are in- 
cluded in the PDF fit. The resulting parton distributions 
HERAPDF1.6 |25| are determined using a fixed value of 
as(M z ) and also using a s (M z ) as a free parameter in the 
fit. The impact of the inclusion of jet data in the PDF fit 
on the gluon distribution and the value of as is demon- 
strated in Fig. [9] Here, the PDFs obtained using the in- 
clusive data only (HERAPDF1.5) and the PDFs resulting 
from including the jet data (HERAPDF1.6) are determined 
using as(Mz) as a free parameter in the QCD analysis. 
In case of the simultaneous fit of PDFs and a s in HERA- 
PDF1.5, the uncertainties on the gluon PDF becomes large 
at low x but as soon as the jet data are included, the corre- 
lation between the gluon PDF and a s (Mz) is reduced, re- 
sulting in significantly reduced uncertainties on the gluon 
PDF. In Fig. 10 the quality of the PDF fit in terms of x 1 
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Fig. 9. Left panel: The parton distribution functions from HER- 
APDF1.5. Right panel: the parton distribution functions from 
HERAPDF1.6 (with HERA jet data included in the fit). In both 
cases, the QCD analysis is performet treating a s (Mz) as a free 
parameter in the fit. The PDFs are presented for Q 2 =10 GeV 2 . 



is represented as a function of the assumption on the value 
of as(Mz). In case of HERAPDF1.5, where only inclusive 
data are used, a very shallow minimum in the x 1 distri- 
bution is observed. The inclusion of the jet measurements 
in the fit results in the clear minimum, which allows the 
simultaneous determination of the PDFs and as(Mz). 
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6 Semi-inclusive data in HERAPDFs 



Semi-inclusive measurements in DIS like jet and heavy 
flavour production, provide additional constraints on the 
PDFs when included into the QCD analysis together with 
inclusive DIS data. The jet production is directly sensitive 
to both the gluon distribution in the proton and the strong 
coupling el's ■ Therefore, including the jet data in the QCD 
analysis can help disentangling the effects from the gluon 
and as in the PDF fit. Similarly, charm and beauty pro- 
duction in ep collisions provide direct access to the gluon 
distribution in the proton, which also depends on the as- 
sumption of the charm and beauty mass values used in the 
PDF fit. 
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Fig. 10. Distribution of x' for PDF fit as a function of the as- 
sumption on the as (M z ) value. The dashed line corresponds to 
HERAPDF1.5, where only inclusive DIS data are used. The solid 
line represents HERAPDF1.6, where the jet data are included. 
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Avalueof a s (M z ) = 0.1202 + 0.0013(exp) ± 0.0007 
(mod/param) ±0.0012(hadronisation)^Qy^(scale) is deter- 
mined ll25l . This result is in very good agreement with 
different results of as determination at HERA and with 



the world average as shown in Fig. 11 It is important to 
note, that the dominant uncertainty arises from the varia- 
tion of the renormalization and factorisation scales in the 
NLO calculation for the jet cross sections. This variation is 
used to mimic the effect of the missing contribution from 
higher orders. 
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Fig. 11. The summary of as(Mz) determination results using the 
jet production at HERA as compared to the world average. The 
upper point corresponds to the simultaneous determination of 
a s (M z ) and the PDF, as described in the text. The experimental 
uncertainties are represented by solid lines, the theory uncertain- 
ties are shown by dashed lines. 



values of the PDF fit including the charm data are deter- 
mined as a function of the input values of charm quark 
mass, m™ od , using different heavy quark schemes, as shown 
in Fig. [H] 

Different assumptions on m"' od in VFN schemes impact 
the charm contribution to the sea quark distribution and 
thus affect the composition of xU(x) from the xu(x) and the 
xc(x) contributions. These in turn influence the value of the 
W ± and Z cross section predictions at LHC. In Fig. 13 the 



NLO prediction |12| for the W + production cross section 
is shown, using parton distributions evaluated with differ- 
ent assumptions on m™ od in various heavy quark schemes. 
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Fig. 12. Comparison of the x 1 distributions of fits to the inclusive 
HERA I + Ff data using different heavy flavour schemes repre- 
sented as lines of different styles. 



6.2 Charm quark measurements in the PDF fit. 

The factorization scheme, used for the PDF determination 
depends on the assumption on the number of flavours in 
the proton, which varies depending on the value of the 
scale, which has to be compared to the threshold, at which 
charm and beauty quarks can be treated as partons. This 
threshold is determined by the mass of charm and beauty 
quarks. Therefore, the treatment of heavy quarks and the 
assumptions on their masses have particular importance 
in the QCD analysis of the proton structure. Different ap- 
proaches to treat heavy quark (heavy quark schemes) are 
used by different PDF fitting groups, corresponding to dif- 
ferent treatment of mass terms in perturbative calculations, 
but also implying differences in the interpretation and as- 
sumptions on the values of the heavy quark masses. Mea- 
surements of charm and beauty production can help con- 
straining some of these assumptions. The charm contribu- 
tion, F c 2 , to the proton structure function Fo is measured 
at HI and ZEUS using different charm tagging techniques. 
These measurements are combined [27 1 taking into account 
the correlations of the systematic uncertainties. The com- 
bined F c 2 data are included in the QCD analysis of the in- 
clusive DIS cross sections, and the effect on the PDFs us- 
ing different assumptions on the charm quark mass, m c , is 
studied 11291 . The sensitivity of the PDF fit to the m c value 
when using combined F c 2 is used to constrain the assump- 
tions on m c in different heavy quark schemes OTI . The^f 2 
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Fig. 13. NLO prediction ofcr w + at the LHC for ^[s = 7 TeV as 
a function ofm""" 1 in the input PDF. The lines show predictions 
for different VFN schemes. The stars show the predictions ob- 
tained with the optimal value ofm""*' used in a given scheme. The 
dashed horizontal lines indicate the range ofo~ w +, determined for 

m ,nod _ m mod ( opt ) 



Taking into account the whole spread of cross section pre- 
dictions using the studied schemes, an uncertainty of 7% 
on the W + production cross section arises due to assump- 
tion on m" 1od in the PDF. However, when using the optimal 



12 



values, m" wd (opt), corresponding to minima from Fig 
as constrained by HERA charm data, this uncertainty is re 
duced to 1%. 
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7 Summary 

Precision of the parton distribution functions is essential 
for accurate predictions of cross sections of the procecces 
at hadron colliders. The proton PDFs are determined using 
the experimental data of DIS and proton-proton collisions. 
Combined data of HERA collider experiments provide the 
most precise constraint on the PDFs at small and medium 
x. HERAPDF is one of the modern QCD analyses in which 
PDFs are determined. The advantages of these PDFs is no 
need for nuclear corrections (in contrast to PDFs using the 
fixed target data), consistent treatment of the systematic 
uncertainties of the experimental data and implementation 
of several phenomenological approaches of heavy flavour 
treatment. Currently, HERAPDF 1.5 at NLO and NNLO 
are among the recommended parton densities for predic- 
tions of LHC cross sections. Recent developments in the 
HERAPDF fits include the QCD analyses of HERA inclu- 
sive DIS data together with jet and charm measurements. 
The inclusion of the jet measurements in the HERAPDF 
analysis reduces the correlation between the gluon distri- 
bution and the strong coupling constant. In such a fit, the 
PDF is determined together with the as(Mz) value. The 
resulting as(Mz) value is in a very good agreement with 
the world average and its precision is limited by the miss- 
ing NNLO calculation for jet production. The inclusion of 
the charm data reduces the correlation between the gluon 
density and the value of the charm mass used in different 
schemes of heavy flavour treatment in the PDF fit. In par- 
ticular, a proper choice of the charm quark mass value is 
important for accurate QCD predictions of W and Z bo- 
son rates at the LHC. The QCD predictions based on the 
HERAPDF1.5 describe the measurements at Tevatron and 
the LHC very well. With increasing precision of the LHC 
data, particular processes like W-boson, jet or top-pair pro- 
duction will provide additional constraints on the PDFs. 

The open source code for QCD analysis of different 
data sets HERAFitter [33] is released by the HI and ZEUS 
collaborations. The program aims for implementation of 
all available schemes for heavy flavour treatment. HERA- 
Fitter is used in the ATLAS and CMS experiments to study 
the impact the electroweak boson production, jet produc- 
tion and top quark production on the proton PDFs. 
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